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ABSTRACT 

Five methods for studying the process of reading a 
text are presented, and a model for discourse processing is outlined. 
Discourse processing refers to comprehension of the meaning 
underlying the verbal message. The methods discussed here Investigate 
the reading process as it occurs, and focus on the amount of time 
taken to complete a task or function. They Include paragraph reading 
time, sentence reading time, word reading time, eye movement 
recording, amd the use of a reader-controlled text window on the 
computer screen to measure reading speed in relation to text type and 
structure. Advantages and disadvantages of the methods are noted. 
Four concepts needed to analyze discourse processing are defined, 
including semantic coherence, theme-rheme structure, subject-matter 
knowledge, and reading purpose. Then the process of discourse 
comprehension is outlined, with reference to four types of coherence: 
referential, withln-sentence, topical, and functional. Five 
hypotheses concerning the processing of specific discourse structures 
are proposed. A 16~ltem bibliography is included. (MSB) 
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In this paper^ five methods for studying the process of readlna a text 
' are presented » and a model for discourse processing Is outlined. The 
model views the process as reader- text Interaction, where the Immediate 
processing of text Is controlled by textual coherence (withln-sentence, 
referential, topical^ and functional), combined with reading strategy 
and reading purpose. 



1. Introduction 



The psychology of reading has become a field of scientific Investiga- 
tion which attracts researchers from various scientific disciplines. This Is 
reflected In the growing nun^ber of studies published In the last decade (see 
Welntraub et a1. 1982). The Increased Interest In exploring the reading 
process can be seen as a consequence of two circumstances: 

(1) An adequate reading skill Is a necessary prerequisite for coping 
with the growing body of written Information In present-day society. This 



Oo 



1 This research was supported by grant no. 09/324 from the Research Council 
for the Social Sciences of the Academy of Finland to Pekka Niemi. 



3 2 

BEST COPY AVAILAME 



- 44 - 



hM 1t«d to thf adoption of tht concept of 'functional llllttracy*. It 
rtf«rs to casts Mhtre the readar cannot conprahtnd Mhat ht Is reading. In 
spite of having obtained a skill of encoding graphic syil>o1s Into words. In 
other words, a functionally Illiterate person Is unable to acquire Informa- 
tion froM written texts. 

(2) Cognitive psychology, which deals with human Information process- 
Inr, has developed Into a dominant paradigm In the field of psychology, 
especially In experimental psychology. The process of reading Is an obvious 
subject matter for this Informatlon-proceising psychology. 

In addition to psychologists, tht exploration of discourse processing 
Is becoming Increasingly popular among researchers In text linguistics and 
educational science. 

Consequently, there Is a practical need for scientific knowledge con- 
cerning the reading process; In addition there exists a promising scientific 
framework for Investigating It. 

The present paper focuses on discourse processing. 'Discourse* Is 
usually taken to mean a body of coherent Information that Is communicated 
either orally or In writing. Discourse processing entails the comprehension 
of the meaning underlying the verbal message of the discourse. In this paper 
the term 'discourse' refers to written messages only. 

Basically there are two different ways to study discourse processing. 
Firstly, one may study the end product of the process. I.e., the memory 
representation. A second approach Is to monitor the process "on line". 
Memory studies can be carried out by asking the reader to recall the text 
after he has finished reading It, or by presenting him with certain aspects 
of the text for recognition, or by asking questions about the content of the 
text. 

A nuRri)er of methods can be employed to monitor discourse processing on 
line. Five of them are Introduced In the following. 

2. On-line methods In the study of discourse processing 

All on-line methods are based on the Idea of mental chronometry. The 
idea is firmly established In the area of experimental psychology. It is 
founded on the fact that any kind of information processing takes place in 
time. Consequently, conclusions about processing can be drawn from the time 
expended on a task. To give a simple example. It takes less time to add 5 
and 2 than to add 7 and 2. So it can be concluded that the former addition 
is performed more easily. 
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A second tsstntitl ch«r«cttri$tic of on-lint mtthodii i$ th«t they make 
the simultaneous tracking of the readinq process feasible. In other words, 
by Wlyi^g on-line method, the researcher is able to monitor the pro- 
cessing of a written text as it is going on. 

A third characteristic feature of on-line methods is that the re- 
searcher is not forced to tamper with the on-going process in order to 
collect data. The obtrusions I am refering to usually appear in the form of 
subsidiary tasks unrelated to normal reading. Reacting to an auditive 
signal, continuous guestion answering and filling up missing words are exam- 
ples of such tasks. 

On-line methods for studying discourse processing can be divided into 

five categories: 

2.1. Paragraph reading time 

As the n«i« indicates, this method involves measuring the reading time 
for an entire paragraph of text. It is a relatively crude procedure of moni- 
toring the on-line comprehension of discourse, because it is unable to re- 
veal any characteristics of reading process inside the paragraph. 

Thus Britton. Heyer. Simpsoit. Holdredge and Curry (1979) could not find 
any evidence that the semantic macrostructure of the text affects immediate 
processing. Britton et al. included the same paragraph in two different 
texts. In one text the target paragraph was positioned high in the content 
hierarchy, that is. it was a central segment of the topic. In the other text 
the target paragraph had a subordinate status in the macrostructure. 

It was hypothesised that the target paragraph would be processed i«re 
deeply when it is a central part of the topic, and consequently its reading 
time would be longer. However, this was not the case: the target paragraph 
required an egual amount of processing time in both texts. 

The method seems relatively unsophisticated, and its application is 
rather restricted. 

2.2. Sentence reading time 

A more precise way to track discourse processing in real time is to 
measure reading time per sentence. In practice, measurement takes place with 
the aid of a computer, which displays the text on a video screen one sen- 
tence at a time. The presentation is self-paced by the reader, who proceeds 
through the text by pressing the button which causes the next sentence to 
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8pp«ar on the screen. The conputer simultaneously records sentence reedinq 
times, i.e., the time intervals between successive button presses. 

As an example of this kind of measurement, I briefly describe a study 
reported by Sanford and Garrod (1978). They investigated how the semantic 
relation in anaphoric reference between the anaphor and the antecedent 
affects discourse processing. The following is an example of their materi- 
als: 

(1) A bus came trundlinq down the hill. 
(TS) The vehicle almost hit a pedestrian. 

(2) A tank came trundling down the hill. 
(TS) The vehicle almost hit a pedestrian. 

The tarqet sentence (TS) is identical in both contexts. Sanford and 
Garrod inferred that if the quality of the semantic bridge between the 
anaphor ('the vehicle') and the antecedents (either 'a bus' or 'a tank') 
controls processinq, there should be a difference in target sentence readinq 
time in these two contexts. The reason is that praqmatic knowledge should 
ma;^e it easier for the reader to rsqard 'a bus' as a class mentoer of the 
concept 'vehicle' in contrast with 'a tank'. This was in fact the case. In 
the latter context the target sentence took longer to read than in the 
former context. 

2.3. Word readina time 

A still more precise way to monitor discourse processing on line is to 
measure readinq time per word. The measurement is also performed using a 
computer. The computer displays the text one word at a time. By pressing the 
button the reader is able to proceed to the next word of the text. The time 
between two button presses is the word readinq time. 

An example of this kind of research is the study by Mitchell and Green 
(1978), which explored, amonq other things, the processing of the relative 
clause, ■'^hey investigated whether it is more laborious to process the rela- 
tive clause when the relative pronoun is omitted than when It is explicitly 
expressed, as in 

(A) The tramp that the dog bit died. 

(B) The tramp the dog bit died. 
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It H«« hypothtslzed that tht clause 'the dog bif is easitr to read in 
sentence A where the relative pronoun is present than in sentence B where it 
is omitted. By W^y^ns the word reading time paradigm Mitchell and Green 
sought an answer for the issue. 

The result was that the relative clause took the same amount of time to 
process in both instances. It seems that for a competent reader the status 
of the relative clause in a sentence is clear enough even without a relative 
pronoun to signal it. 

2.4. Eye movement recording 

Recording the reader's eye movements while he is reading a text is a 
modern application of the old proverb >^ich speaks of the eye mirroring the 
mind. In other words, the method is based on the assumption that there 
exists a high correlation between the reader's attention and the location of 
his gaze on the line of print. 

Eye movements involve two things: (I) fixations, during *i*tich the eyes 
stay fixated on a certain word, and (2) saccades. which are short, rapid 
movements between fixations. The processing of text takes place during the 
fixations. Saccades serve the function of bringing new segments of the text 
into the fovea. Vision is clearest in the area of the text which falls into 
the fovea; furthermore, only the information located in the fovea at the 
moment can be processed to the semantic l?vel. Saccades »*«ich shoot back- 
wards in the text are called regressions. 

The following example illustrates eye movement data in discourse pro- 
cessing. It is taken from a study by Carpenter and Just (1981). 
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Tht upper nuii*«r cxprtssts tht ordtr of the fixation, and the lower 
number qlves Its duration (In msec). 

As an example of eye movement research I mention a study by Frazler and 
Rayner (1982). who Investigated the ambiguities that can be created by 
different syntactic structures. For example: 

Before the king rides his beautiful white horse Is always groomed. 

It takes a while for the reader to realize that 'before the king rides' 
forms a clause. It was the Intention of the authors to find out how the pro- 
cessing of such «nb1guit1e$ takes place. For that purpose Frazier and Rayner 
recor' i readers' eye movements. The parsing strategy of a typical reader is 
to integrate new words with the clause currently being processed ti^ienever 
this is possible. In the example, the last part of the sentence (i.e.. 
is always groomed') cannot be integrated with the clause Just completed. 
According to the study, this usually caused long fixations in that region, 
indicating that the last few words were laborious to comprehend. The long 
fixations were usually followed by a regression and a re-fixation in the 
region where the real clause boundary is located (i.e. rides/ his 

...'). In other words, the typical reader acts as though the sentence makes 
up one single clause. When the last words of the sentence are encountered, 
he has to reconstruct the syntactic structure of the sentence. This process 
is reflected in the eye movements. 

However, there was no uniformity of the parsing strategy adopted by 
subjects. Individuals varied in the way the structural ambiguities of the 
target sentence were resolved. 

2.5. Textwindow 

The "textwindow" system, developed at the University of Umei in Sweden 
(Jarvella and Lundberg 1984). is a recent methodological addition to dis- 
course processing research. The method operates as follows. The system is 
implemented on a computer which controls the movements of a "window" through 
which the reader can see segments of text on the computer screen. The window 
displays only a limited segment of the text (for example, a stretch of 8 
characters). The window moves along the lines of the text in the same direc- 
tion that the text would normally be read. New letters appear one by one 
from the right, causing those on the left to fall off. The reader is able to 
control the movement of the window by speeding it up. slowing it down. 
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stopping It, or reversing the run of the window In order to on backwards on 
the text. 

An Integral part of the procedure are the so-called 'probes'. I.e., 
auditive signals that are located In relevant text segments. The reader is 
expected to react to these slonals as fast as possible. The reaction time to 
the probes Is used as an estimate of the mental load of the reader. 

In their research program, Jarvella and Lundberq propose to use the 
textwindow method to Investigate the following kinds of guest Ions: How does 
the macrostructure of the text (a story grammar) control discourse process- 
ing? What role does pragmatic knowledge play In the Immediate processing of 
discourse? How are references resolved? 

The advantages and disadvantages of the methods presented above are 
currently the subject of much discussion. In most cases, comparisons seem to 
favour eye movement recording. There are four reasons for that: 

(1) The technlgue makes It feasible to expose a few sentences of the 
text at the same time. Conseguently, the reader Is allowed to progress 
rather naturally throuoh the text. 

(2) The reader can regress to earlier parts of the text. The regres- 
sions are also recorded. 

(3) Current technology allows eye movement recording which Is precise 
enough to track processing word by word. 

(4) The method does not compel the reader to fixate on every word (cf. 
the measure of word reading time). Instead, he may skip words as in normal 
reading. 

However, there are some problems with the method. Firstly, the data Is 
gulte laborious to analyze. Secondly, an accurate calibration of the device 
is difficult to achieve. 

Just, Carpenter and WooUey (1982) compared the method of measuring 
reading time per word with the eye-fixation paradigm, using eye movement 
data as the base line. They report a moderate correlation between the 
methods. Thus it may be concluded that the technigue of measuring word 
reading times approaches eye movement recording as a measure of processing 
time expended on successive words cf a text. 

In the next section I propose a frawework for studying the immediate 
processing of discourse. 
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3. An outline of the process of discourse comprehension 

To understand the message contained in a discourse, the reader must 
integrate the pieces of information inherent in the text with each other. 
The writer, on the other hand, should stage his message as coherently as 
possible. The framework I propose for discourse processing starts from this 
reader-writer interplay. In order to comprehend the message, the reader 
tries to maintain the semantic coherence between the textual propositions. 
In other words, the reader tries to build up a coherent semantic network of 
a text. Four different concepts are needed in analysing this process. 

(1) The concept of semantic coherence of text, in which three aspects 
may be distinguished: wi thin-sentence, referential, and topical coherence. 

(2) The concept of theme-rhewe structure of the text. The concept has 
to do with the distinction between old (theme) information and new (rheme) 
information (cf. Scinto 1978). Theme repeats information given in the 
earlier part of the text, while rheme introduces new information (for other • 
definitions of the concept, see Enkvist 1975). 

(3) The concept of subject-matter knowledge. This refers to the fact 
that there exist considerable individual differences in the semantic repre- 
sentations containing our knowledge of the world. As a concequence. individ- 
uals vary with respect to 'functional' coherence, i.e.. the extent to **»ich 
the semantic world explicated in a discourse resembles the semantic network 
of the reader (the reader's subject-matter knowledge). In other words, one 
reader might be more familiar with the topic discussed in a discourse than 
another. 

(4) The concept of the purpose of reading . The reader progresses 
through the text differently depending on what he wishes to extract from 
it. Thus it makes a difference whether a text is read for leisure or **iether 
it must be read in order to memorize as many details as possible, or to find 
an answer to a given question. The concept of 'reading strategy' refers to 
the same phenomenon, i.e.. the variation in the way readers construct an 
internal representation of a text. 

Vauras. von Wright and Kinnunen (1983) have investigated the learning 
strategies of high school pupils by studying recall protocols. They distin- 
guish between a holistic and an atomistic strategy. I propose that the con- 
cept of learning strategy as employed by Vauras et al. can also be applied 
to the inwediate processing of a disccurse. The proposal is based on the 
assumption that the reader is constantly integrating the content of the text 
into an internal representation. Thus, by applying a holistic strategy the 
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resdtr tries to build up a coherent representation by finding out the main 
topic and Inteqratinq the other propositions of the text with It. When the 
reader w^es «" atomistic strategy, he Is not bothered to construct a 
fully Integrated representation of the text. 

It may be argued that not every reader Is able to smoothly adapt his 
reading strategy according to various purposes of reading. The argument 
raises the Issue of nwtacognltlon. Reader's metacognltlons consist of his 
Ideas about what reading Is, and what constitutes learning from text. I.e., 
whether It Is memorizing the facts, understanding, problem solving, or some- 
thing else. In spite of Its Importancet this Issue will not be discussed 
further In the present connection. 

By putting the above concepts together. It Is possible to a propose a 
framework for the on-line processing of discourse. The framework of the con- 
trolling factors of discourse comprehension Is given In Figure 1. It applies 
only to a competent reader, whose word recognition processes have become 
automatized. The Intention Is not to Ignore word recognition and syntactic 
processes as controlling factors; they are excluded merely to keep the 
framework simple. 
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Figure 1. An outline of the controlling factors In the on-line processing of 
a text. 



The sentence (n) Is taken arbitrarily from the middle of a text being 
read. It consists of a theme and a rheme. The brings forward an Issue which 
Is known to the reader from the previous text. The semantic relation that 
exists between the theme and the previous text Is called referential coher- 
ence. The rheme of the sentence (n) Introduces new Information about the 
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Issue contained In the theme. The semantic relation between them and rheme 
is called w1 thin-sentence coherence. The rheme is also linked with the pre- 
vious text throuqh topical coherence. The connection between the world the 
writer is creating throuqh his text and the reader's semantic network Is 
here called functional coherence. 

These four types of coherence are Implicit in the discourse when the 
reader begins to extract information from it. It depends on the reading 
stra^egy how much the reader is affected by variations in the strength of 
coherence. Depending on the purpose of readinq, readers vary with respect to 
the amount of mental effort they put into malntaininq coherence throuqh 
reference resolution, integration of new information with aiven information^ 
and the utilisation of subject-matter knowledqe. 

In order to specify the proposal further, I suqqest some verifiable 
assumptions concerninq coherence effects on the immediate processing of dis* 
course. The assumptions deal with the situation where the reader applies a 
holistic strategy in order to process the text as completely as possible. 
They are all instances of the general idea that processing ease depends on 
how obvious the link is between a text segment and the semantic world of the 
previous text. 

The following hypotheses can be formulated: 

1. The sentence rheme is processed more fluently v*ien it can be associ- 
ated with the theme through inferences based on the textual world. When the 
association needs to be inferred from the external knowledge of the reader, 
a greater processing effort will be required. (Within-sentence coherence). 

2. The sentence theme is easiest to process if a close link exists 
between the theme and its antecedent. That Implies, for example, that pro* 
cessing is easier when reference is based on direct repetition, and more 
difficult when it is based on a substitution by a class member (cf. the 
bus-vehicle example above). (A qualitative aspect of referential coherency. 

3. The closer the theme and its antecedent are located to each other, 
the easier it will be to process the theme (cf. Kintsch & van 01 Jk 1978). In 
other words, it takes less effort to integrate the theme with an antecedent 
that is still active in the reader's mind, compared with a situation where 
the antecedent is to be re*act1vated, i.e., recalled. (Positional distance 
in referential coherence). 

4. If the information Inherent in the rheme is semantically closely 
linked with the main topic of the text, it is easier to integrate the sen<* 
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ttnce rheme with preceding text. (A Qualitative aspect of topical coher- 
ence). 

S. In the case where the rheme Is not to be Integrated with the topical 
Information of a text but with a piece of thematlcally subordinate Informa- 
tion, the following might be assumed: If the Information with which the 
rheme Is to be Integrated Is active In the reader's mind, the sentence rheme 
is easier to process. Information Is supposed to be more active If It has 
appeared recently In the text. (An aspect of positional distance in topical 
coherence). 

As was mentioned above, the assumptions are thouoht to apply to the 
case where the text Is processed as fully as possible. In other words, they 
are unlikely to be valid If the reader employs an atomistic strategy. With 
respect to the atomistic strategy the following general argument might be 
true: The less obvious are the connections between two text segments, the 
less this semantic relation affects text processing, that Is, when the 
atomistic strategy Is applied, the reader does not pel in extra effort to 
build up coherence In those sections of a text where this would be regulred. 

As an example of a study on the gualltatlve aspect of topical coherence 
(cf. hypothesis 4 above), consider Tommola (1985). In one his experi- 
ments, two types of text were compared, which were similar In their refer- 
ential coherence but dissimilar In their topical coherence. Referential 
coherence was controlled by using the same referent repeatedly as sentence 
theme In both types of text. The dissimilarity of topical coherence was 
created by writing type A texts in terms of a single 'frame' or 'scenario', 
while the sentences of type B texts kept shifting the scenario, and thus 
prevented the emergence of a discourse topic. 

Tommola Included In both text versions an Identical target sentence 
towards the end of the texts. In version A It explicitly expresses the topic 
of the text; In version B It contains Just another piece of Information 
about the topical referent. According to the fourth hypothesis (above), the 
rheme of the target sentence should be easier to process In text A than In 
text B because In version A It Is semantical ly closely related to the 
topic. This should not be the case In version B. Tommola's investigation 
confirmed the hypothesis: the target sentence was read faster in text A than 
In text B. 

The model outlined above reflects the modern 'Zeitgeist' of research In 
reading. As a result of a drastic change In research orientation In the 
1970 's, the focus Is n^w on meaning and comprehension. However, the process 
of meaning extraction is mainly studied throuah recall protocols. Conse- 



ERIC 




- 54 - 



quently, not very much is known about how various text and reader charac- 
teristics affect the immediate semantic processing of written lanquaqe. The 
model concentrates on this matter. 

On the whole, why should we know anything about the iftwediate process- 
ing of discourse? I see two potentially important reasons, both of **»ich 
stem from the idea of text processing as reader-text interaction. The first 
is associated with the reader, the second with the text. 

Considering the issue from the reader's point of view, knowledge of the 
on-line processing of written discourse can be applied to reveal individual 
differences in comprehension processes. On-line studies will tell us about 
the effective and flexible reading strategies of competent readers. They 
will also aid the diagnosis of potential reasons for the difficulties that 
some readers experience in extracting information from text. 

Investigating the role of the text in controlling the comprehension 
process can reveal the possible effects of various semantic structures on 
inroediate processing. This knowledge can be taken into account, for example, 
in the production of learning material. 

In our Department we have recently started a research project on 
readers' eye movements. We expect that this promising method will provide us 
with relevant facts about the on-line processing of discourse. 
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